DiscoverHuggingFace 每日AI论文速递2025.10.21 | 模型不懂光影折射;小模型也能写报告
2025.10.21 | 模型不懂光影折射;小模型也能写报告

2025.10.21 | 模型不懂光影折射;小模型也能写报告

Update: 2025-10-21
Share

Description

本期的 13 篇论文如下:

[00:21 ] 🪞 PICABench: How Far Are We from Physically Realistic Image Editing?(PICABench:我们离物理真实的图像编辑还有多远?)

[01:04 ] 🤖 DeepAnalyze: Agentic Large Language Models for Autonomous Data Science(DeepAnalyze:面向自主数据科学的智能体大模型)

[01:50 ] 🗜 Glyph: Scaling Context Windows via Visual-Text Compression(Glyph:通过视觉-文本压缩扩展上下文窗口长度)

[02:23 ] 🔍 Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation(面向通用检索增强生成的混合模态检索研究)

[03:10 ] 🔗 When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling(何时集成:定位Token级位置实现稳定高效的大模型集成)

[04:09 ] 🎯 Annotation-Efficient Universal Honesty Alignment(注释高效型通用诚实对齐)

[04:49 ] 🖌 Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback(Uniworld-V2:借助扩散负感知微调与MLLM隐式反馈强化图像编辑)

[05:46 ] 👁 RL makes MLLMs see better than SFT(强化学习让多模态大模型看得比监督微调更清楚)

[06:33 ] 🚀 Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling(视觉自回归模型在推理时扩展上击败扩散模型)

[07:09 ] 🎨 ConsistEdit: Highly Consistent and Precise Training-free Visual Editing(ConsistEdit:面向MM-DiT的高一致免训练视觉编辑)

[07:56 ] 🔄 Deep Self-Evolving Reasoning(深度自演化推理)

[08:22 ] 🧠 Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI(超越流水线:模型原生智能体AI范式转移综述)

[09:07 ] 🔮 Chronos-2: From Univariate to Universal Forecasting(Chronos-2:从单变量到通用预测)

<figure></figure>

【关注我们】

您还可以在以下平台找到我们,获得播客内容以外更多信息

小红书: AI速递

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

2025.10.21 | 模型不懂光影折射;小模型也能写报告

2025.10.21 | 模型不懂光影折射;小模型也能写报告